A Multilingual Polarity Classification Method using Multi-label Classification Technique Based on Corpus Analysis

نویسنده

  • Yohei Seki
چکیده

In NTCIR-7 MOAT, we participated in four sub-tasks (opinion & holder detection, relevance judg-ment, and polarity classification) at two languagesides: Japanese and English. In this paper, we fo-cused on the feature selection and polarity classifi-cation methodology in both languages. To detectopinion and classify the polarity, the features wereselected based on a statistical χ-square tests overNTCIR-6 and MPQA corpora. We also comparedseveral multi-label classification methods to clas-sify positive, negative, and neutral polarity. Theevaluation results suggested that the coverage ofthe features in Japanese was acceptable for theopinion analysis in newspaper articles, but therewas still a room for improvement in the coverageof the features in English. We also found the resultof SVM voting approach was slightly better thanthe results of Multi-label classification approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Associations between Class Labels in Multi-label Classification

Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...

متن کامل

NSIGHT-1 at SemEval-2016 Task 5: Deep Learning for Multilingual Aspect-based Sentiment Analysis

This paper describes our deep learningbased approach to multilingual aspectbased sentiment analysis as part of SemEval 2016 Task 5. We use a convolutional neural network (CNN) for both aspect extraction and aspect-based sentiment analysis. We cast aspect extraction as a multi-label classification problem, outputting probabilities over aspects parameterized by a threshold. To determine the senti...

متن کامل

Domain Adaptation for Opinion Mining: A Study of Multipolarity Words

Expression of opinion depends on the domain. For instance, some words, called here multi-polarity words, have different polarities across domain. Therefore, a classifier trained on one domain and tested on another one will not perform well without adaptation. This article presents a study of the influence of these multi-polarity words on domain adaptation for automatic opinion classification. W...

متن کامل

A Threshold Based Multi-Label Classification

In classification problems, a pattern may belong to one or multiple categories. It is essential to deal multi-label classification accurately and efficiently. Threshold strategies can be used for multi-label classification. We propose four schemes to compute threshold for a threshold based multi-label classification. We validate our method using multi-label text data and multi-label image data....

متن کامل

INSIGHT-1 at SemEval-2016 Task 5: Deep Learning for Multilingual Aspect-based Sentiment Analysis

This paper describes our deep learningbased approach to multilingual aspectbased sentiment analysis as part of SemEval 2016 Task 5. We use a convolutional neural network (CNN) for both aspect extraction and aspect-based sentiment analysis. We cast aspect extraction as a multi-label classification problem, outputting probabilities over aspects parameterized by a threshold. To determine the senti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008